Semi-automatic Ground Truth Generation for Chart Image Recognition
نویسندگان
چکیده
While research on scientific chart recognition is being carried out, there is no suitable standard that can be used to evaluate the overall performance of the chart recognition results. In this paper, a system for semi-automatic chart ground truth generation is introduced. Using the system, the user is able to extract multiple levels of ground truth data. The role of the user is to perform verification and correction and to input values where necessary. The system carries out automatic tasks such as text blocks detection and line detection etc. It can effectively reduce the time to generate ground truth data, comparing to full manual processing. We experimented the system using 115 images. The images and ground truth data generated are available to the public.
منابع مشابه
Generating Ground Truthed Dataset: Automatic or Semi-automatic?
Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating chart image dataset and multilevel ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dat...
متن کاملEfficient Generation of Large Amounts of Training Data for Sign Language Recognition: A Semi-automatic Tool
We have developed a video hand segmentation tool which can help with generating hands ground truth from sign language image sequences. This tool may greatly facilitate research in the area of sign language recognition. In this tool, we offer a semi automatic scheme to assist with the localization of hand pixels, which is important for the purpose of recognition. A candidate hand generator is ap...
متن کاملStrokeBank: Automating Personalized Chinese Handwriting Generation
Machine learning techniques have been successfully applied to Chinese character recognition; nonetheless, automatic generation of stylized Chinese handwriting remains a challenge. In this paper, we propose StrokeBank, a novel approach to automating personalized Chinese handwriting generation. We use a semi-supervised algorithm to construct a dictionary of component mappings from a small seeding...
متن کاملSegmentation semi-automatique en plans pour la génération de cartes denses de disparités Semi-automatic Planar Segmentation Applied to the Generation of Dense Disparity Maps
This work falls under computer vision framework and more precisely planar segmentation applied to the generation of dense disparity maps. The goal is to produce new stereoscopic images with ground truth in order to evaluate and to compare precisely stereovision algorithms. We consider piecewise planar scenes and we propose a semi-automatic segmentation method based on the active contour models ...
متن کاملA Ground Truth Tool for Synthetic Aperture Radar (SAR) Imagery
The performance of Computer Vision algorithms has made great strides and it is good enough to be useful in a number of civilian and military applications. Algorithm advancement in Automatic Target Recognition (ATR) in particular, has reached a critical point. State-of-the-art ATRs are capable of delivering robust performance for certain operational scenarios. As Computer Vision technology matur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006